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The two surface glycoproteins S and HE of human coronavirus OC43 (HCV-OC43) were isolated from the viral 
membrane and purified. Only the S protein was able to agglutinate chicken erythrocytes, indicating that this viral 
protein is the major hemagglutinin of HCV-OC43. The receptor determinant recognized by this virus on the surface of 
erythrocytes is /V-acetyl-9-0-acetylneuraminic acid (Neu5,9Ac 2 ) which is also used by bovine coronavirus for attach¬ 
ment to cells. By analyzing erythrocytes containing different amounts of Neu5 r 9Ac 2 in either of two linkage types, it was 
found that there are subtle differences in the affinity of both viruses for 9-O-acetylated sialic acid. Bovine coronavirus 
was more efficient in recognizing low amounts of Neu5,9Ac 2 «2,3 linked to galactose, whereas HCV-OC43 was supe¬ 
rior with respect to the «2,6 linkage. The gene coding for the S protein of HCV-OC43 was cloned and sequenced. A 
large open reading frame predicts a polypeptide of 150 kDa in the unglycosylated form. A protein of about 190 kDa is 
expected if the 20 potential glycosylation sites are used for attachment of N-linked oligosaccharide side chains. These 
predictions were confirmed by in vitro transcription and translation of the gene in the presence or absence of canine 
pancreatic microsomal membranes. A high degree of sequence homology was found between the S proteins of HCV- 
OC43 and bovine coronavirus. Structural and functional analyses of more strains should help to identify the location of 
the sialic acid-binding site. © 1993 Academic Press, Inc. 

INTRODUCTION minal hydrophobic membrane anchor. The spike pro¬ 

teins of BCV, MHV, and IBV are cleaved by an intracel- 
Coronaviruses are a family of large, enveloped vi- lular protease into two subunits of comparable size, 

ruses, which contain a single-stranded RNA genome Furthermore the S protein is expected to have an over- 

of positive polarity with a size of about 30 kilobases. a || structure similar to the hemagglutinin of influenza A 

The virion contains two to three structural glycopro- virus (Wilson et at., 1981; De Groot et at., 1987; Ras¬ 
ters: the membrane protein (M); the spike, surface, or schaert and Laude, 1987). The S protein is a major 

peplomer protein (S); and the hemagglutinin/esterase target of the cellular immune response to coronavir- 

protein (HE). The presence of a HE protein is a charac- uses anc j p| a ys an important role in the initial stage of 

teristic feature of a serological subgroup of coronavi- infection. It mediates the attachment of the virus to the 

ruses comprising human coronavirus (HCV) strain ce || surface receptors and induces the fusion of the 

OC43, bovine coronavirus (BCV), hemagglutinating en- v j ra | anc j cellular membranes (reviewed by Spaan era/., 

cephalomyelitis virus (HEV), and mouse hepatitis virus 1 gg8) 

(MHV). Coronaviruses, which lack an HE protein and Different types of virus receptors have been identi- 

which have no antigenic relationship to the above- fied f 0r coronaviruses. A member of the carcinoem- 

mentioned viruses, include human coronavirus HCV- bryonic antigen family of proteins is a receptor for MHV 

229E, porcine transmissible gastroenteritis virus (Williams etai, 1991); TGEV and HCV-229E recognize 

(TGEV), and avian infectious bronchitis virus (IBV) (for a aminopeptidase N as a receptor (Delmas et at., 1992; 

review see Siddell et a!., 1983; Spaan et a!., 1988). Yeager et a!., 1992). BCV uses N-acetyl-9-O-acetyl- 

Human coronaviruses are known as respiratory patho- neuraminic acid as a receptor determinant to initiate 

gens; in addition, they have been associated with diar- infection (Schultze and Herrler, 1992). Results ob- 

rhea and multiple sclerosis (McIntosh, 1974; Resta et tained with erythrocytes indicate, that the attachment 

a/., 1985; Burks ef a/., 1980). 0 f HCV-OC43 and HEV is also dependent on the pres- 

The S protein is a typical class I membrane protein, ence of Neu5,9Ac 2 on the cell surface (Vlasak et a!., 

with an amino-terminal signal peptide, which is 1988; Schultze et at., 1990). Although the HE protein 

cleaved during protein processing and a carboxy-ter- has an affinjty for g-O-acetylated sialic acid, too, the S 

protein has been shown to be the major sialic acid- 

' To whom reprint requests should be addressed at Institut for bindin 3 P rotein of BCV (Schultze et at., 1991b). 
Viroiogie, Phiiipps-Universitat Marburg, Robert-Koch-Str. 17 , 3550 ^ the present report we show that the S protein is 

Marburg, Germany. also the major sialic acid-binding protein of HCV- 
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OC43. The spike protein was further analyzed (i) by 
determining the nucleotide sequence of the corre¬ 
sponding gene, (ii) by in vitro transcription and transla¬ 
tion of the cloned gene, and (iii) by analyzing the effi¬ 
ciency in recognizing Neu5,9Ac 2 . 


MATERIALS AND METHODS 
Viruses and cells 

HCV-OC43 was obtained from the American Type 
Culture Collection (ATCC) and passaged three times in 
human rectal tumor cells {HRT-18). Three days p.i. 
hemagglutination titers of 256 HAU/ml were deter¬ 
mined in the supernatant using 0.5% chicken erythro¬ 
cytes for HA-titration (Schultze et a!., 1990). HRT-18 
cells were obtained from ATCC and grown in Dul- 
becco’s minimal essential medium supplemented with 
10% fetal calf serum. 

Purification of virus 

Two to 4 days p.i. supernatants of infected cells 
were harvested and centrifuged for 30 min at 8,000 
rpm and 4°. Virus was pelleted from the clarified me¬ 
dium by centrifugation for 2.5 hr at 25,000 rpm and 4° 
using an SW28 rotor. The virus pellet was homoge¬ 
nized in phosphate-buffered saline deficient in magne¬ 
sium and calcium ions (PBS def.) by forcing it through a 
syringe needle and loaded on a discontinuous sucrose 
gradient consisting of 60, 50, and 30% sucrose in PBS 
def. After centrifugation for 1.5 hr at 40.000 rpm and 4° 
in an SW41 rotor, the virus was collected at the 30- 
50% interface. The virus was diluted with PBS def. and 
pelleted fori hr at 40,000 rpm and 4° in an SW41 rotor. 

Purification of viral glycoproteins 

The glycoproteins S and HE were isolated from the 
viral membrane by treatment with octylglucoside and 
purified by sucrose gradient centrifugation as de¬ 
scribed recently for BCV (Schultze eta!., 1991 a, b). The 
glycoproteins were analyzed for HA activity and acetyl- 
esterase activity as described recently (Schultze et at., 
1990, 1991b). 

Resialylation of erythrocytes 

Neu5,9Ac 2 was attached to cell surface glycoconju- 
gates in two linkage types by incubating erythrocytes 
from 1-day-old chicken with CMP-activated sialic acid 
and either of two sialyltransferases (Gal/31,3GalN Ac 
a2,3-sialyltransferase or Gal/31,4GlcNAc «2,6-sialyl- 
transferase, respectively; Boehringer-Mannheim) as 
described recently (Schultze eta!., 1990, 1992). 


Preparation of viral RNA 

The pellet of purified virus was homogenized in TE 
buffer, pH 7.6, and subjected to an overnight protein¬ 
ase K digestion (Sambrook et al., 1990). Viral RNA was 
extracted twice with phenol/chloroform 1/1 and twice 
with chloroform/isoamylalcohol 24/1 followed. The up¬ 
per phase was precipitated by 3 M NaAc, pH 5.3, and 
EtOH at -20° for 48 hr. After pelleting and washing 
with 70% EtOH, vRNA was dissolved in DEPC-treated 
water. 

DNA sequencing 

The cDNA of the cloned HCV-OC43 S gene was se¬ 
quenced with a modification of the dideoxy chain termi¬ 
nation method (Sanger et al., 1977) using Sequenase 
2.0 (USB). Sequencing primers were between 18 and 
26 nucleotides in length and were synthesized with an 
oligonucleotide synthesizer (Applied Biosystems). Part 
of them were designed on the basis of the published S 
gene sequence for BCV strain Mebus (Abraham et al., 
1990). The reaction products were analyzed on 5 and 
6% acrylamide gels containing 7 M urea. 

Cloning of the S gene 

The vRNA was transcribed into DNA using MoMuLV 
reverse transcriptase (Boehringer-Mannheim). Be¬ 
cause of the inability of the enzyme to produce a full- 
length cDNA, the S gene was transcribed in two over¬ 
lapping parts (Fig. 1). Primer Reb-1 binds to nucleo¬ 
tides 2244-2264 and Rbx-2 to the 3'-end of the S gene. 
Both of them contain at the 5'-end a BamH\ linker, Rbx- 
2 an additional Xho\ linker (Reb-1 5' CGCGGATCC- 
TACCTACTGTGAGATCACATG 3'; Rbx-2 5' CCTC- 
G AGGG ATCCC ACGAACTT AGTCGT CAT GT G 3 r ). 
First-strand cDNA synthesis was carried out in a vol¬ 
ume of 20 n\ containing 10 m M DTT, 6 m/W MgCl 2 , 1 
ng vRNA, 100 ^g/ml BSA, 50 m/WTris-HCI, pH 8.3, 40 
m M KCI, dNTP at 1 m M each, 0.75 tiM primer, 10 U 
MoMuLV reverse transcriptase. Denaturation for 3.5 
min was followed by annealing for 30 min at room tem¬ 
perature. Polymerization was carried out at 42° for 1 
hr. Afterward the enzyme was denatured by incubation 
at 99° for 7 min. This step was followed by PCR using 
the Gene Amp Kit and a thermal cycler (Perkin-Elmer 
Cetus) according to the manufacturer’s instructions. 

The sense primers were used at 0.15 nM and con¬ 
tain at their 5'-end a BamVW linker (Pbam-1) or an Xho\ 
linker (Pxho-2), respectively. (Pbam-1: 5' CGCGGA- 
TCCGCTGCATGATGCTTAGACCA 3'; Pxho-2: 5' 
CCTCGAGGACCAGCATTGCTATTTCGGAATAT 3'). 
There were 25 cycles performed each consisting of 
denaturation for 1 min at 95°, annealing for 1 min at 
59° and a 3 min extension step at 72°. In each cycle 
the polymerization step was automatically extended 
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Fig. 1 . Flow chart of the procedure chosen for cloning the S gene 
of HCV-OC43. 


for 2 sec. Five independent reverse transcription and 
PCR reactions were done and mixed afterward. To re¬ 
move the Taq polymerase which might have been 
bound to the reaction products {Crowe et ai, 1991), 
the whole mixture was digested with proteinase K (17 
jug/m!) for 1 hr at 37°, extracted with phenol/chloro- 
form/isoamylalcohol (50/48/2), and precipitated. After 
digestion with BamVW or Xhol, respectively, the prod¬ 
ucts were separated on a 1% agarose gel and eluted 
using Quiaex (Diagen). The two S gene fragments were 
subcloned into pBluescript (Stratagene). Strain XI-1 
Blue (Stratagene) of Escherichia coli was transformed 
with this plasmid using standard procedures. The two 
overlapping inserts were prepared by digesting the re¬ 
combinant plasmids with either BamVW or Xhol, respec¬ 
tively, followed by agarose gel electrophoresis and elu¬ 
tion. They were dephosphorylated with 8 units calf in¬ 
testinal alkaline phosphatase (Boehringer-Mannheim) 
for 1 hr at 37°, extracted with phenol/chloroform/iso- 
amylalcohol, and precipitated. Then both fragments 
were blunt-ended with Pvull (Fig. 1) and subjected a 
second time to electrophoresis and elution. Both frag¬ 
ments were ligated with 12.5 units T4-ligase (Boeh- 
ringer-Mannheim) for 12 hr at 14°. An aliquot of this 
ligation mixture was incubated with BamHl- and Xhol- 


digested pBluescript and 5 units T4-ligase at 16° for an 
additional 12 hr. After transformation, three positive 
clones were sequenced at both ends of the S gene 
(300 nucleotides in both directions). No nucleotide dif¬ 
ference was found. One of the positive clones was se¬ 
quenced completely in both directions. 

In vitro transcription and translation 

The recombinant plasmid pBSKS + /atS8 was linear¬ 
ized with Xhol, digested with proteinase K, extracted 
with phenol/chloroform, precipitated, and dissolved in 
DEPC-treated TE-buffer, pH 7.5, at 1 nQlfil. In vitro 
transcription was done in a total volume of 25 *tl. using 
2 ^g of linearized plasmid DNA and the mCAP kit (Stra¬ 
tagene), which yielded capped mRNAs. These mRNAs 
were subsequently treated for 5 min at 37° with 10 
units RNase-free DNasel and then extracted once with 
phenol/chloroform. After precipitation the mRNA was 
resuspended in TE buffer. In vitro translation was per¬ 
formed with a commercial kit (IN VITRO EXPRESS, 
Stratagene), which consisted of a pretreated white rab¬ 
bit reticulocyte lysate. The reaction was performed ac¬ 
cording to the manufacturer’s instructions with 1 -3 ng 
mRNA and 35 S-methionine. To obtain proteins which 
were processed and core-glycosylated, translation re¬ 
actions were done using canine pancreatic micro¬ 
somal membranes (Promega). The a-mating factor 
from S. cerevisiae served as a control for signal pro¬ 
cessing and the /3-lactamase from E. coli as a control 
for glycosylation (data not shown). 

lmmunoprecipitation 

After translation, the proteins were immunoprecipi- 
tated with a rabbit antisera directed against the S pro¬ 
tein of BCV. Incubation for 1 hr on ice was followed by 
incubation for 30 min at 4° with protein A-Sepharose 
(Serva) on a shaker and three cycles of washing at 4° 
with PBS def. Afterward, the immunoprecipitate was 
resuspended in sample buffer containing 100 mlW 
DTT, heated to 95°, subjected to SDS-PAGE on 8% 
gels (Laemmli, 1970), and prepared for fluorography. 

RESULTS 

The S protein of HCV-OC43 is a hemagglutinin 

In order to assign the hemagglutinating activity of 
HCV-OC43 to one of the surface proteins, the glyco¬ 
proteins were isolated in purified form. The membrane 
proteins were solubilized by detergent treatment and 
separated by sucrose gradient centrifugation as de¬ 
scribed recently for BCV (Schultze et at., 1991 a,b). A 
peak of the S protein was detected in fraction 3, 
whereas HE was recovered from fraction 6. A similar 
sedimentation behavior has been reported for the gly¬ 
coproteins of BCV (Schultze eta!., 1991b). These frac- 
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TABLE 1 

Analysis of the Glycoproteins S ano HE of HCV-OC43 for 
Acetylesterase and Hemagglutinating Activity 



HA activity 

Acetylesterase 

Glycoprotein 3 

(HA-units/ml) 

(OD405) 

S protein 

1024 

<0.01 

HE protein 

<2 

0.56 


3 The glycoproteins of HCV-OC43 were purified by sucrose gra¬ 
dient centrifugation. Fractions 3 and 6 representing the peak frac¬ 
tions of S and HE, respectively, were dialyzed and analyzed for HA 
and esterase activity. 

tions were analyzed for acetylesterase and hemaggluti- 
nating activity. As shown in Table 1, only fraction 6 was 
able to cleave p-nitrophenyl acetate, confirming that 
the HE protein is the esterase of HCV-OC43. This pro¬ 
tein was, however, unable to agglutinate chicken eryth¬ 
rocytes. On the other hand, the S protein present in 
fraction 3 was found to be a very potent agglutinating 
agent. This result indicates that the S protein is the 
major hemagglutinin of HCV-OC43. This finding is in 
agreement with the data reported for BCV (Schultze et 
a/., 1991 b). A difference between both viruses was ob¬ 
served in the electrophoretic behavior of the S protein. 
While the surface protein of BCV is cleaved by a cellu¬ 
lar protease into two subunits, Si and S2, the glyco¬ 
protein of HCV-OC43 is present on virions in the un¬ 
cleaved form (not shown). As has been shown previ¬ 
ously, cleavage is possible by incubation with trypsin 
(Hogue and Brian, 1986). 

Comparison of the sialic acid-binding activity of 
HCV-OC43 and BCV 

The sensitivity of the erythrocyte receptors for HCV- 
OC43 to acetylesterase suggested that 9-O-acetylated 
sialic acid is a receptor determinant for this virus (Vla- 
sak et a!., 1988). Therefore, it can be concluded from 
the data presented in Table 1 that the S protein is not 
only the major hemagglutinin but also the major sialic 
acid-binding protein. Direct evidence for the impor¬ 
tance of Neu5,9Ac 2 was obtained by resialylation of 
erythrocytes from 1-day-old chicken. These cells lack 
9-O-acetylated sialic acid and are resistant to agglutina¬ 
tion by HCV-OC43. Neu5,9Ac 2 was attached to the 
cell surface by incubation of neuraminidase-treated 
erythrocytes with sialyltransferase and CMP-activated 
sialic acid. In order to determine whether there is a 
preference of the virus for a certain linkage type, two 
different transferases were used, which allow to 
compare the following oligosaccharide structures: 
l\leu5,9Ac 2 a2,3Gal|81,3GalNAc, and Neu5,9Ac 2 «2, 
6Gal/31,4GlcNAc, respectively. A further variation was 
obtained by applying different concentrations of CMP- 


sialic acid. In this way batches of erythrocytes were 
obtained which differed in their content of Neu5,9Ac 2 . 
In the range of concentrations used in the present 
work, a proportional amount of Neu5,9Ac 2 is trans¬ 
ferred to the cell surface (Herrler et al., 1992). As 
shown in Table 2, HCV-OC43 was able to recognize 
9-O-acetylated sialic acid in both the «2,3 and the a2,6 
linkage. The exact amount of sialic acid transferred 
was not determined, because CMP-Neu5,9Ac 2 was 
not available in radioactive form. Using erythrocyte 
preparations with different amounts of 9-O-acetylated 
sialic acid on the surface, it was possible to compare 
the efficiency of different viruses in recognizing 
Neu5,9Ac 2 . HCV-OC43 was found to require less si¬ 
alic acid in the a2,6-linkage than BCV for agglutination 
of erythrocytes. The a2, 3-linkage, on the other hand, 
was recognized more efficiently by BCV. As far as the 
preference for one of the two linkages is concerned, 
HCV-OC43 was found to resemble influenza C virus 
(strain Johannesburg/1/66) more than BCV (Table 2). 

Sequence analysis of the S gene 

To determine the sequence of the S gene of HCV- 
OC43, viral RNA was isolated, transcribed into cDNA, 
and amplified by PCR. The DNA obtained was cloned 


TABLE 2 


Comparison of the Efficiency of HCV-OC43, BCV, and Influenza 
C Virus (Johannesburg/ 1/66) in Recognizing 9-O-Acetylated Sialic 
Acid as a Receptor Determinant 



CMP-$ialic 

acid 3 

(nmol) 

HA activity (HA units/ml) 

Linkage type 

BCV 

HCV-OC43 

Influenza 

C 

<*2,3 

4 

1024 

128 

512 


2 

512 

32 

128 


1 

64 

<2 

<2 


0.5 

16 

<2 

<2 

«2,6 

8 

512 

128 

512 


4 

128 

64 

512 


2 

<2 

64 

512 


1 

<2 

32 

512 


0.5 

<2 

32 

512 


0.25 

<2 

32 

512 

Erythrocytes from 

adult chicken 

—- 

1024 

128 

1024 


a Neuraminidase-treated erythrocytes from 1-day-old chicken 
were incubated as a 50% suspension in a volume of 100 with 
either 1,5 mU of Gal/51 1 3GalNAc <*2,3 or 5 mil of Gal£1, 4GlcNAc 
tv2,6-sialyltransferase and the amount of CMP-activated Neu5, 9Ac 2 
indicated. The resialylated cells were used to determine the HA titer 
of BCV, HCV-OC43, and influenza C virus. For comparison the HA 
titer of the virus suspensions obtained with untreated erythrocytes 
from adult chicken is given at the bottom of the table. 
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2031 

1 AAAC ATQTTTTTGATACTTTTAATTTCCTTACCAACOGCTTTTGCTGTTATAGGA GATTTAAAGTGTACT 2101 

71 TCAGATACTAGTTATATTAATGATAAAGACACCGGTC CTCCTCCTATAAGTACTGATACTGTTGATGTTA 2171 

141 CTAATGGTTTGGGTACTTATTATGTTTTAGATCGTGTGTATTTAAATACTACGTTTTTGCTTAATGGTTA 224 2 

211 TTACCCTACTTCAGGTTCCACATATCGTAATATGGCACTGAAGGGAACTGTACTATTGAGCACACTATGG 2 311 

281 TTTAAACCACCATTTCITTCTGATTTTATTAATGGTATTTTTGCTAAGGTCAAAAATACCAAGGTTATTA 2381 

351 AAGATGGTGTAATGTATAGTGAGTTCCCTGCTATAACTATAGGTAGTACTTTTGTAAATACATCCTATAG 24 51 

421 TGTGGTAGTACAACCACGTACAATCAATTCAACACAGGATGGTTATAATAAATTACAAGGTCTTTTAGAG 2521 

491 GTCTCTGTTTGCCACTATAATATGTGCGAGTACCCACAAACGATTTCTCATCCTAACCTGGGTAATCATC 2591 

561 GCAAAGAACTATGG CATTTGGATACAGGTGTTGTTTCCTGTTTATATAAG CGTAATTTCACATATGATGT 2661 

631 G AATG CTG ATT ATTTGT ATTTTCAT'ITTT ATC AAGAAGGTGGT ACTTTTTATGCAT ATTTT A C AG AC ACT 27 31 

701 GGTGTTCTTACTAAGTTTTTGTTTAATGTTTATTTAGGCATCGCGCTTTCACACTATTATGTCATGCCTC 2801 

771 TGACTTGTAATAGTAAGGTTAAGAATGGTTTTACTTTAGAATATTGGGTTACTCTCTCACTTCTAGACAA 2871 

841 TATTTACTCGCTTTCAATCAAGATGGTATTATTTTTAATGCTGTTGATTGTATGAGTGATTTTATGAGTG 2941 

911 AGATTAAGTGTAAAACACAATCTATAGCACCACCTACTGGTGTTTATGAATTAAACGGTTACACTGTTCA 3011 

9 81 GCCAATC G C AG ATGTTT ACCG ACGT AAACCT AATCTTCCC AATTGC AAT AT AG AAG CTTGGCTT AATGATA 3081 

1051 AGTCGGTG CCCTCTC CATT AAATTGGG AACGT AAGACATTTT C AAATTGTAATTTTAAT ATGAGCAGCCT 3151 

1121 GATGTCTTTTATTCAGGCAGACTCATTTACTTGTAATAATATTGATGCTGCTAAGATATATGGTATGTGT 3221 

1191 TTTTCCAGCATAACTATAGATAAGTTTGCTATACCCAATGGCAGGAAGGTTGACCTACAATTGGGTAATT 3291 

1261 TGGGCTATTTG CAGTCATTTAACTATAGAATTGATACTACTG CAACAAGTTGTCAGTTGTATTATAATTT 3361 

1331 ACCTG CTG CT AATGTTTCTGTT AG C AGGTTTAATCCTTCTACTTGGAAT AAG AG ATTTGGTTTTATAG AA 34 31 

14 01 G ATTCTGTTTTTAAGCCTCG ACCTG C AGGTGTTCTTACT AATCATGATGT AGTTTATGC AC AAC ACTCTT 3 501 

1471 TC AAAG CTCCTAAAAATTTCTGT CCGTGTAAATTG AATGGTTCGTGTGTAGGT AGTGGTC CTGCTAAAAA 3 571 

1541 TAATGGTATAGGCACTTGTCCTGC AGGTACT AATTATTTAACTTGTGAT AATTTGTG CACT CCTGATCCT 3641 

1611 ATTACATTT AAAGCTACAGGTACTTAT AAGTGCCCCCAAACTAAATCTTTAGTTGGCATAGGTGAGCACT 3711 

1681 GTTCGGGTCTTGCTGTTAAAAGTGATTATTGTGGAGGCAATTCTTGTACTTGCCGACCACAAGCATTTTT 37 81 

1751 GGGTTGGTCTGCAGACTCTTGTTTACAAGGAGACAAGTGTAATATTTTTGCTAATTTTATTTTGCATGAT 3851 

1821 GTTAATAGTGGTCTTACTTGTTCTACTGATTTACAAAAAGCTAACACAGACATAATTCTTGGTGTTTGTG 3921 

1891 TTAATTATGA CCT CTATGGTATTTTAGGCCAAGGCATTTTTGTTG AGGTTAATG CGACTTATTAT AATAG 39 91 

19 61 TTGGCAGAACCTTTTATATGATTCTAATGGTAATCTCTACGGTTTTAGAGACTACATAACAAACAGAACT 4061 


TTTATGATTCGTAGTTG CTATAG CGCTCGTCTTTCTG CGGCCTTT CACGCTAACT CTTC CG AACCAGCAT 

TG CTATCTCGC AATATTAAATCCAACTACGTTTTTAATAATAGTCTTACACGA CAGCTGCAACC CATTAA 

CTATTTTGATAGTTAT CTTGGTTGTGTTGTCAATGCTTATAATAGTACTG CTATTTTTGTTCAAACATGT 

GATCTCACAGTAGGTAGTGGTTACTGTGTGGATTACTCTAAAAAC AGACOAAGTCGTGGA GCGATTACCA 

CTGGTTATCGGTTTACTAATTTTGAGCCATTTACCGTTAATTCAGTAAACGATAGTTTAGAACCTGTAGG 

TGGTTTGTATGAAATTCAAATACCTTCAGAGTTTACTATACGTAATATGGAGGAGTTTATTCAAACAAGC 

T CTCCT AAAGTTACTATTGATTCTGCTGCATTTGTCTGTGGTGATTATGCAGCATGTAAATCACAGTTGG 

TTGGATATGGTAGTTTCTGTGATAACATTAATGCGATACTCACAGAAGTAAATGAACTACTTGACACTAC 

ACAGTTGCAAGTAGCTAATAGTTTAATGAATGGTGTTACTCTTAGCACTAAGCTTAAAGATGGCGTTAAT 

TTCAATGTAGACGACATCAATTTTTCCCCTGTATTAGGTTGTCTAGGCAGCGAATGTAGTAAAGCTTCCA 

GTAGATCTGCTATAGAGGATTTACTTTTTGATAAAGTAAAGTTATCTGATGTCGGTTTTGTTGAGGCTTA 

TAATAATTGTACAGGAGGTGCCGAAATTAGGGACCTCATTTGTGTGCAAAGTTATAAAGGCATCAAAGTG 

TTGCCTCCACTG CTCTCAG AAAATCAGATCAGTGGATACACTTTGGCTG CCAC CTCTGCTAGTCTATTTC 

CTCCTTGGACTGCAGCAGCAGGTGTACCATTTTATTTAAATGTTCAGTATCGCATTAATGGGCTTGGTGT 

CAC CATGGATGTGCTAAGTCAAAATCAAAAGCTTATTGCTAATGCATTTAACAATGCC CTTTATGCTATT 

CAGG AAGGGTTCGATG C AA CCAATTCTGCTTTAGTT AAAATTCAAG CTGTTGTTAATGCAAATGCTGAAG 

CTCTTAATAACTTATTGCAACAACTCTCTAATAGATTTGGTCCTATAAGTGCTTCTTTACAAGAAATTCT 

ATCTAGACTTGATGCTCTTGAAGCGGAAGCTCAGATAGATAGACTTATTAATGGTCGTCTTACCGCTCTT 

AATG CTTATGTTTCTC AACAG CTT AGTGATT CTACACTGGT AAAATTTAGTGCAG CACAAGCTATGGAGA 

AGGTTAATG AATGTGTC AAAAG CCAAT CATCTAGGAT AAA TTTCTGTGGTAATGGTAATCATATTATATC 

ATTAGTGCAG AATG CTCCATATGG'TTTGTAT'TTTATC CACTTT AGTTATGTCCCTACT AAGT ATGTCACA 

GCGAAGGTTAGTCCCGGTCTGTGCATTGCTGGTGATAGAGGTATAGCTCCTAAGAGTGGTTATTTTGTTA 

ATGTAAATAATACTTGGATGTACACTGGTAGTGGTTACTACTACCTTGAACCTATAACTGAAAATAATGT 

TGTTGTTATGAGTACCTGGGCTGTTAATTATACTAAAGCGCCGTATGTAATGCTGAACACTTCAATACCC 

AACCTTCATGATTTTAAGGAAGAGTTGGATCAATGGTTTAAAAATCAAACATCAGTGGCACCAGATTTGT 

CACTTGATTATATAAATGTTACATTCTTGGACCTACAAGTTGAAATGAATAGGTTACAGGAGGCAATAAA 

AGTCTTAAATCAGAGCTACATCAATCTCAAGGACATTGGTACATATGAATATTATGTAAAATGGCCTTGG 

TATGTATGGC TTTTAATCTQCCTTGCTGQTGTAGCTATGCTTGTTTTACTATTCTTCATA TGCTGTTGTA 

CAGGATGTGGGACTAGTTGTTTTAAGAAATGTGGTGGTTGTTGTGATGATTATACTGGATACCAGGAGTT 

AGTAATCAAAACTTCACATGACGACTAA 


Fig. 2. Nucleotide sequence of the S gene from HCV-OC43. Sequences coding for the N-terminal signal peptide and the C-termina! membrane 
anchor, respectively, are underlined and printed in bold letters. The sequence coding for the potential cleavage site is bold-printed and 
underlined twice. The predicted polymorphic region in the SI subunit of BCV (Yoo et a!., 1991) is indicated by a wavy line. 


in two steps into pBiuescript and sequenced. The nu¬ 
cleotide sequence determined for the S gene of HCV- 
0043 contains a single large open reading frame 
(ORF), which extends from the first ATG codon (nu¬ 
cleotides 5-7) to the TAA stop codon at nucleotides 
4086-4088 (Figs. 2 and 3). This ORF has a coding 
capacity of 1361 amino acids corresponding to a M r of 
150 kDa in the unprocessed form. There are 20 poten¬ 
tial sites for N-glycosyiation in the sequence of HCV- 
OC43 in contrast to 19 in the S protein of BCV-Mebus 
(Abraham et ai, 1990). The predicted N-glycosylation 
site which is missing in BCV is located at amino acid 
position 502-504. Assuming a mean M f of 2100 Da 
per carbohydrate side chain (Hunter et ai., 1983), a M r 
of about 190 kDa is expected for the glycosylated pro¬ 
tein. Like the corresponding proteins of other corona- 
viruses, the S protein of HCV-OC43 shows the charac¬ 
teristics of a typical class I membrane protein. It has an 
N-termina) signal peptide with a hydrophobic core of 
13 amino acids and a helix-breaking glycine at position 
17 (Watson, 1984), suggesting that the signal pepti¬ 
dase cleaves between amino acids 17 and 18 (von 
Heijne, 1986). Close to the C-terminus, there is a 
stretch of hydrophobic amino acids (1322 to 1338) 
which is predicted to form an a-helix and to function as 
a membrane anchor (Chou-Fassman prediction; HU- 


SAR-Program, EMBL Heidelberg). By comparison with 
the S protein of related coronaviruses, the sequence 
772-Arg-Arg-Ser-Arg-Gly-778 is suggested to be the 
site for the proteolytic cleavage into the subunits Si 
and S2 (Fig. 3). This sequence is not identical to the 
proteolytic cleavage site determined for different 
strains of BCV and MHV, which contain either two 
pairs of arginines or a single arginine separated by two 
nonbasic amino acids from a pair of arginines (Schmidt 
etai., 1987; Luytjes et ai., 1988; Boireau etai, 1990; 
Parker et ai, 1990; Abraham etai, 1990). 

HCV-OC43 has 120 amino acids, 92 in SI and 28 in 
S2, which differ from the S protein of BCV-Mebus 
(Abraham et ai., 1990) including insertions and dele¬ 
tions. The resulting amino acid homology for Si is 88% 
and for the S2 subunit 95.3%. The insertions found in 
HCV-OC43 are located within the first 256 amino 
acids, whereas deletions were found within amino 
acids 494-531. To see whether the cloned S gene is in 
a functional form, in vitro transcription/translation ex¬ 
periments were performed. The capped mRNAs ob¬ 
tained with the transcription kit were incubated with 
white rabbit reticulocyte lysates in the presence or ab¬ 
sence of canine pancreatic microsomal membranes. 
[ 35 S] Methionine-labeled proteins were immunoprecipi- 
tated with a polyclonal antiserum directed against the 
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S protein of BCV and analyzed on 8% SDS-polyacryl- 
amide gels (Fig. 4). Translation products were obtained 
with mRNAs derived from plasmids containing either 
the intact S gene or a fragment coding for Si. The 
faster migrating bands probably represent premature 
reaction products. The sizes of the largest proteins de¬ 
tected (indicated by arrows) are in good agreement 
with the molecular weights calculated for the glycosy¬ 
lated and unglycosylated form of the S protein (see 
above) and the Si protein, respectively. Cleavage of S 
into the two subunits Si and S2 was not observed in 
this system. These results indicate that the cloned S 
gene of HCV-OC43 is in a functional form. 


DISCUSSION 

A high degree of homology was found between the S 
genes of HCV-OC43 and BCV. The overall relatedness 
between these two viruses had already been indicated 
by oligonucleotide fingerprinting which predicted a se¬ 
quence homology of greater than 96% (Lapps and 
Brian, 1985). Compared to the N protein with 97.5% 
(Kamahora et a!., 1989) and the HE protein with 95% 
amino acid homology (Zhang eta!., 1992), the S protein 
of HCV-OC43 is less conserved (91.2%), especially the 

51 subunit with a homology of 88%. The fact that the 

52 portion of the molecule is more conserved than Si 
was also found in other coronaviruses, e.g., in IBV, 
MHV, and FIPV (for a review see Spaan et ah, 1988). 
Interestingly, HCV-OC43 shows insertions and dele¬ 
tions in the Si part of the molecule, which, however, 
counterbalance each other and lead only to a loss of 
two amino acids compared to BCV. The deletions are 
all located in a part of Si (amino acids 455-599), which 
in the case of BCV has been described as a polymor¬ 
phic region (Yoo ef a/., 1991). It is the same region 
which in MHV-JHM is almost completely and in MHV- 
A59 partly missing (Schmidt ef a/., 1987; Luytjes etal., 
1988). Taken all the sequence changes in the S protein 
of HCV-OC43 and BCV together, 25.5% of the amino 
acids are different within the polymorphic region com¬ 
pared to 12.2% in the rest of the Si subunit and only 
7.7% in S2. 

The S protein of most strains of BCV and MHV is 
cleaved into the subunits Si and S2 by a cellular pro¬ 
tease acting at the C-terminus of the sequence N-Arg- 
Arg-Xxx-Arg-A.rg-C (Schmidt et at., 1987; Luytjes ef a/., 
1988; Boireau eta!., 1990; Parker ef a/., 1990; Abra¬ 
ham era/., 1990; Zhang etal., 1990). The substitution 
of Gly for Arg at position -1 of the cleavage site of HCV- 
OC43 may be responsible for the reduced cleavability 
reported for the spike protein of this virus (Hogue and 
Brian, 1986). We also observed that S was only cleaved 
to an extent of 10%, when OC43 was propagated in 
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Fig. 3. The predicted amino acid sequence of the S protein from 
HCV-OC43 compared to strain Mebus of BCV (Abraham ef a/., 
1990). For HCV-OC43, the complete sequence is presented; for 
BCV, only the amino acid differences are shown. The amino-terminal 
signal peptide and the carboxy-terminal membrane anchoring do¬ 
main are underlined and printed in bold letters. Potential sites for 
N-glycosylation (Asn-Xxx-Ser/Thr) are indicated in the same way. 
Amino acids which are inserted or missing are marked by asterisks 
and the predicted polymorphic region in the Si -subunit of BCV (Yoo 
etal., 1991) is marked by a wavy line. The proteolytic cleavage site is 
printed in bold letters and underlined twice. 

HRT-18 cells in contrast to BCV grown in this cell line, 
which showed nearly 80% cleavage of the S protein 
(data not shown). In the case of influenza viruses, the 
sequence Arg-Xxx-Lys/Arg-Arg has been determined 
as the consensus motif for the cleavage of the hemag- 
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Fig. 4. Analysis of the polypeptides obtained by in vitro transcrip- 
tion/translation of the S gene of HCV-OC43 and a fragment coding 
for the S1 subunit. The 35 -methionme-!abeled proteins were immuno- 
precipitated with a polyclonal antiserum directed against the S pro¬ 
tein of BCV and subjected to SDS-PAGE under denaturing condi¬ 
tions. The samples incubated in the presence of canine pancreatic 
microsomal membranes are indicated by (+). The size of molecular 
weight markers (kDa) is indicated on the left. 

glutinin by the cellular protease furin (Vey et al, 1992; 
Stienecke-GrOber et ai, 1992). This consensus se¬ 
quence can also account for the differential cleavage 
of the S protein of HCV-OC43, BCV, and MHV, respec¬ 
tively. The proteolytic cleavage of the influenza hemag¬ 
glutinin is an absolute requirement for the fusion activ¬ 
ity, and consequently for the infectivity, of the virus. As 
far as coronaviruses are concerned, proteases may 
have an enhancing effect (St. Cyr-Coats er at ., 1988). 
However, fusion activity and infectivity appear not to be 
strictly dependent on the proteolytic cleavage of S. The 
S protein of TGEV and related viruses completely lacks 
a motif for proteolytic cleavage and is only found in the 
uncleaved form (Rasschaert and Laude, 1987). More¬ 
over, after site-directed mutagenesis of the cloned S 
gene, the surface protein of MHV was obtained in an 
uncleaved form, which nevertheless was fusion-active 
(Stauber et al, 1993). 

We presented evidence that the S protein of HCV- 
OC43 is a hemagglutinin which interacts with 
Neu5,9Ac 2 present on the surface of erythrocytes. The 
HE protein was found to be unable to agglutinate 
chicken erythrocytes. Similar findings have been re¬ 
ported recently for BCV (Schultze eta!., 1991b), where 
the HE protein only agglutinates mouse or rat erythro¬ 
cytes which are very rich in surface-bound 9-O-acety- 
lated sialic acid. Thus, the S protein is the major sialic 
acid-binding protein for both HCV-OC43 and BCV. This 
may not be surprising because of the sequence similar¬ 
ity. There are, however, subtle differences in the affinity 
of both viruses for sialic acid. While BCV is more effi¬ 
cient in recognizing Neu5,9Ac 2 in an «2,3-linkage, 
HCV-OC43 is more effective toward an «2,6-linkage. In 


this respect, HCV-OC43 was similar to influenza C 
virus. As both viruses infect humans, one might specu¬ 
late that the difference between these viruses and BCV 
in the preference for a certain linkage type is related to 
the host tropism. However, more strains have to be 
analyzed for conclusive evidence. Assuming that the 
binding site for 9-0-acety)ated sialic acid is part of the 
Si subunit, there are 92 amino acid changes which 
could be responsible for the difference between HCV- 
OC43 and BCV. This number is too high to allow a 
conclusion about the receptor-binding site. However, 
by extending this approach to more strains, it should 
be possible to get a clue about the location of the bind¬ 
ing site for 9-O-acetylated sialic acid. 
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